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□Trade-off between area and sequentially 

■ Basic operation of compiler technology can be used to achieve abstraction 

• Generate .compressed' configurations, which are expanded on larger 

arrays while loading 
- No minimum array but maximum array defined by compiler setting 
. Tradeoff. Number of (re-)configurations vs. usable ALU-PAEs 





□ Basic Operation Method 

■ LOAD/STORE processor 

■ RAM-PAEs act as Vector-Registers (2D/3D) 

■ Irregular data access patterns are linearized by 
LOAD/STORE while accessing RAM-PAEs 

• Can be done by uP also! 
- LOAD ... Conf, ... Conf 2 ... .... Conf„... STORE 

■ Each Configuration is regarded as an OpCode 

■ No Configuration/Array internal status 



□ Code Analysis 

■ Data Dependency Analysis 

■ Data Flow Analysis 

■ Interprocedural Alias Analysis 

• Pointer analysis: statically allocated data, dynamically allocated 

" Interprocedural Value Range Analysis 



□ Code Optimizations 
PACT , Loop Transformations 

• Loop Normalization 

• Loop Reversal 

• Loop-Invariant Code Motion 

• Loop Unswitching 

• Loop Interchange 

• Loop Tiling 

• Loop Skewing 
. Loop Coalescing/Collapsing 

• Loop Fusion 

• Loop Distribution 

• Loop Unrolling 

• Loop Peeling 

• Loop Splitting 

• Loop Pushing/Embedding 



Y'PP 



Strength Reduction 

Induction Variable Semination 
■ Strip Mining 
' Scalar Expansion 
> Array Contracting/Shrinking 

• Scalar Replacement 

• Reduction Recognition 

• Idiom Recognition 

• Procedure Mining 

• Software Pipelining 

. Vector Statement Generation 

• Node Splitting 

• If Conversion 

. statement Reordering 
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□ RAM-PAEs RAM is ..embedded" into cache 

□ RAM-PAEs can operate like cache-lines 

• Homogeneous embedded in cache 

■ Handling access rights between uP and XPP 

■ Handling context switching / hyperthreading 

■ Abstracting non linear address patterns 
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□ XPP operates like an RISC-Processor 

□ RAM-PAEs act like registers 

□ Each configuration is atomar (unbreakable) 

□ Configurations running time is limited 

□ LOAD Configuration 

• Loads external data into internal RAM-PAEs 

□ Data operations (one or multiple configurations) 

■ Unbreakable - no internal status to be saved! 

□ STORE Configuration 

■ Stores internal data into external RAM-PAEs 

□ Interrupts (Task/Thread-Switches) only between (re)configurations not 
at runtime 




I5S1 QXPP Technology allows sequential processing 

^RACrS " Within ALU-PAEs using the configuration register file as a random access 

code memory 

- . ^ - - Coupling an ALU-PAE with a RAM-PAE. ALU-PAE acts like a uC, RAM- 

PAE is according Data- and Code-Memory 
• As an enhancement lOPAEs can be used to access peripherials and external 
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□ Handled by sequential processing within PAEs 

■ I.e. Floating-Point, Division etccan be emulated by sequential 

multicycle PAE operations 
- Higher precision is calculated as a multicycle operation 
• results are transfered in two bus cycles 
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